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ABSTRACT 

To study the applicability of simple structure logic 
for factorial data from scientific disciplines outside psychology, 
four correlation matrices from each of six scientific areas were 
factor analyzed by five factorxng methods. Resulting factor matrices 
were compared on two objective criteria of simple structure before 
and after rotation. Factoring methods differed in their ability to 
provide a simple structure for a given set of data. Scientific 
disciplines differed in the extent to which they provided data which 
could be factor analyzed, but not in the simple structure of factors 
obtained. Varimax rotation did not systematically improve simple 
structure. (Author) 
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Method of Extraction and Simple Structure of 



Data from Diverse Scientific Areas 
Robert M. Thorndike 
Western Washington State College 

To study the applicability of simple structure logic for factorial 
data from scientific disciplines outside psychology! four correlation 
matrices from each of six scientific areas were factor analyzed by five 
factoring methods. Resulting factor matrices were compared on two objective 
criteria of simple structure before and after rotation. Factoring methods 
differed in their ability to provide a simple structure for a given set of 
data. Scientific disciplines differed in the extent to which they provided 
data which could be factor analyzed, but not in the simple structure of 
factors obtained. Varimax robation did not sys .ematically improve simple 
structure. 
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Method of Factor Extraction and Simple Structure , _ 

of- Data from Diverse Scientific Areas 
Robert M. Thorndike 
Western Washington State College 

The search for simple structure in sets of variables has assumed an 
important position psychological applications of factor analysis. It is 
reasonable to question whether the concept of simple structure deserves such 
a lofty place in factor analytic practice. This question has become in- 
creasingly appropriate with the growing use of factor analysis by researchers 
in disciplines outside psychology. Can the logical procedures which 
psychologists have developed for their own problems be applied to data 
from Qthejr- areas of- science? The present study examines several issues 
gepmain to the general use of factor analysis and the logic of simple 
structure. 

Method 

Tp, study the effects of changes in method of factor extraction and source 
pf data on simple structure, five factoring models were applied to data 
from six scientific %reas. The five methods and the type of reduction of 
the correlation matrix used by each are described in detail elsewhere 
(Thorndike, 1??0). Briefly, the five methods were principal components • 

with Kaiser criterion, minimum residuals, maximum likelihood, image 
qovariance, and alpha. Data were obtained from the literature in the areas 
of medicine, economics, ability measurement, personality measurement, 
sociology, and iaxonoiry. Four examples of data from each area were 
analyzed by each factor model. 

Criteria of Simple Structure 

Objectification of simple structure has been a problem for factor 
analysts since Thurstone proposed that factors should be rotated to a simple 
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structure solution. The proportion of variables located in the hyperplane 
(the hyperplane count) has traditionally been used as a criterion. However, 
this method tends to locate, define, and evaluate factors by what they are 
not . 



A revision of the simple structure model which is based on positive 
information and which provides an objective index of fit to the model has 
recently been proposed (Thorndike, 1971). Briefly, the revision states that 
maximum simplicity of structure is obtained when each variable loads on only 
one of the factors and all of its other loadings are zero. The index of 
goodness -of-f it to this model is given by 
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where h? = the communality of variable i. 

and.a^j = the factor loading of variable i_ oh factor 
It has a range of 0 to 1.0 and gives at least ordinal information. The 
results from applying this index and the hyperplane count uu w.o 2^ . ets 
of data for each of the analyses are given' in Tables 1 through. 3. 

Results 

Tables 1 and 2 contain the values of the criteria for adequacy of 
approximation to simple structure which were described above. The values 
of G and proportion of hyperplane loadings in Table 1 are for the unrotated 
matrices while those in Table 2 are for the matrices after varimax rotation. 
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Insert Table 1 about here 



The omissions in the tables indicate that a solution could not be 
obtained for that set of data by that method of extraction (for a discussion, 
of these results see Thorndike, 1970). It is interesting that only the 
psychological areas yield data which are generally analyzable by all methods. 
We may also note that the principal components and minimum residual methods 
provide solutions for all cases. 

The first aspect of the data which is apparent in both tables is that, 
with the exception of the results for some image matrices, the value of G 
always exceeds the proportion of variable loadings in the hyperplane. The 
reason that the hyperplane count exceeds G for some of the image matrices 
appears to be that one or more very small factors are retained by the 
decision rule used for the image analyses. These sma -1 ! fac 4- air 

all or nearly all hyperpla.— ^oadang-, , .resulting in a substantially higher 
hyperplane count for. the image matrices than for those of any other methvdl 
In every case at least half of the image loadings before rotation are i 
the hyperplare. Tne values for G are also unusually high, but they have 
not been raised as much as the hyperplane count by the inclusion of 
essentially null factors. 

Close attention to Table 1 reveals some interesting relationships 
among the factor methods. In comparing principal components with minre ; 
on the value o: G, _t may be seen that minres is superior in 18 of "the 1 4 
cases, indicating that minres provides a better fit to the revised simple 
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structure model than principal components does. The same conclusion is 
reached in regard to the traditional simple structure model and its 
criterion, the hyperplane count. Minres is superior in 20 of 23 cases 
and there is one tie. Expanding the comparison to include alpha, the three 
methods can "be ordered on the sums of therr ranks on the criteria for the 
IT cases providing data. Again it will he seen that minres provides the 
best fit to either simple structure model. Alpha performs less well than 
minres 5 but better than principal ocmponents. When all five methods are 
compared for the 11 relevant cases, image is seen to give the best solution 
in almost every case. On the basis of the two criteria to fit to the 
simple structure models it is definitely superior to any other method. 
However, this finding is also an artifact of the tendency of the image 
method to retain null factors, as pointed out before. (It is worth noting 
that null factors do not appear to distort G as much as they do the hyper- 
plane count.) A more interesting finding is that maximum likelihood ranks 
second, slightly ahead of minres. The two statistical methods, maximum 
likelihood and minres, provide the best fit to simple structure when the 
spuriously inflated values of image analysis are ignored. Alpha and 
prdnicpal components retain their relative positions. 



Insert Table 2 about here 



From a comparison of Table 1 with Table 2 it may be seen that the 
various factor methods are differentially affected by varimax rotation. 
The superiority of minres over principal components is less marked for 
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the rotated matrices than for the unrotated matrices on both criteria, 
with the G index showing a greater gain than the hyperplane count for 
the principal components method. On the second comparison there is 
little difference between minres and principal components on G, and alpha 
is decidedly third, while the hyp erp lane count orders the methods: minres, 

alpha, principal components. With all methods included, image analysis 
remains superior for the same reason cited above. According to G, principal 
componenets and maximum likelihood tie for second, minres is next, and 
alpha perforins most poorly. “Hie hyperplane count suggests a different 
ordering. Image remains first, followed by minres and maximum likelihood, 
tied. Alpha is next and principal components is least satisfactory. The 
primary differential effect of rotation is upon the fit of the principal 
components matrices to the revised simple structure model. Rotation 
improves principal components relative to the other methods. Why this 
should be so is not immediately clear. It is noteworthy that the effect 
is observed only for the revised simple structure model. 

Aside from their relative magnitudes and fhirly consistent ordering 
of the methods with regard to adequacy, there does not appear to be a 
consistent;.- relationship between the two criteria. In only about 75$ of 
cases did they agree on whichiinitial solution was best from a simple 
structure viewpoint, and half of those agreements were for the image 
solution, which has a tendency to inflate both criteria because of 
the presence of null factors. Essentially the same rates of agreement were 
found for the rotated matrices and these agreements could also be attributed 
largely to the presence of the image solution. When the values for the 
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image matrices are omitted from consideration, the rate of agreement drops 
below 30 %. Thus, the two indices do not appear to agree at much above 
a chance level. It should be noted, however, that often the index values 
for different methods were not very different. Values of the two criteria 
may be differently affected by changes in the number of factors, possibly 
accounting for the lack of agreement. 

Rotation 

The effect of rotation on the values of the two indices is shown in 
Table 3. Entries are the amount of increase in each index due to rotation, 
negative values indicating a decrease in the index. It is obvious there 
is no consistent effect of rotation on either G or the hyperplane count. 



Insert Table 3 about here 



However, both indices generally agree on the effect of rotation for a single 
matrix and usually also agree for all matrices for a given set of data. 

There is no consistent effect of rotation for all of the data from any 
scientific area. Even in the psychological areas, where an analytic 
rotation is often taken as the final solution, there is no consistent 
tendency for varimax rotation to improve on the simple structure of the 
initial solution as it is evaluated by the criteria used in this study. 

The only relatively consistent finding which appears in Table 3 is 
that variamax rotation, with one exception, makes the simple structure of 
factors obtained from image analysis worse. This is probably due to the 
fact that most of the original image matrices included several null 
factors which would yield inflated values for both criteria. A varimax 
rotation would tend to build up slightly these null factors, reducing the 
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number of hyperplane loadings and causing a decrease in G because of the 
increasing number of non-zero and non-maximum loadings. The other methods/ 
which do not generally retain null factors, do not show thi^ consistent 
decrease in simple structure as a result of rotation. 

There is a slight tendency for data from non-psychological areas to 
show better simple structure than do psychological data. This tendency 
is found for both criteria and it may therefore be inferred that the 
logic of simple structure is applicable to data from all areas of science 
included in this study. However, the most important observation which 
can be made is that the correlation matrices resulting from non-psychological 
areas of science ,cannot, in general, be analyzed by all factoring methods. 

The researcher in non-psychological areas would probably be well advised 
to select a’ principal components solution, even though it may not result 
in an optimum simple structure. 

Discussion 

There are few empirical grounds in this study on which to compare 
the two simple structure criteria. However, the logic by which they were 
derived does permit some evaluation of the information which may be conveyed. 
The hyperplane count w as derived from Thurstone's criteria for rotation 
to simple structure. It yields information on the proportion of loadings 
which are within some specified range of zero. The assumption is that the 
remaining loadings will be large and meaningful. However, basing a 
criterion for adequacy of the obtained solution on negative information 
seems a Questionable practice, just as rotating by finding hyperplanes 
with the maximum number of informationless variables is questionable. 
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By contrast, the proposed simple structure index, G, and its associated 
model have logical appeal because they treat as the simplest structure 
that structure which is most simple. The model is readily quantified in 
a manner which makes maximum use of the information, both positive and 
negative. Both the revised model and the index, G, give meaningful 
results for cases where a single factor is the appropriate solution. 

Also, and perhaps more importantly, values of G do not seem to be distorted 
q.ite so much by the imaige method's tendency to retain null factors. 
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Table 1 
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